Development of An External Cluster Validity Index using Probabilistic Approach and Min-max Distance
نویسندگان
چکیده
Validating a given clustering result is a very challenging task in real world. So for this purpose, several cluster validity indices have been developed in the literature. Cluster validity indices are divided into two main categories: external and internal. External cluster validity indices rely on some supervised information available and internal validity indices utilize the intrinsic structure of the data. In this paper a new external cluster validity index, MMI and its normalized version NMMI have been implemented based on Max-Min distance along data points and prior information using structure of data. A new probabilistic approach has been implemented to find the correct correspondence between the true and obtained clustering. Different possibilities for probabilistic approaches have been considered and tried to rectify their problems. Genetic K-means clustering algorithm (GAK-means) and single linkage clustering technique have been used as the underlying clustering techniques. Results of proposed index for classifying the true partitioning results have been shown for six artificial and two real-life data sets. GAK-means and single linkage clustering techniques are used as the underlying partitioning techniques with the number of clusters varied in a range. The MMI and NMMI index are then used to determine the appropriate number of clusters. Performance of MMI along with its two versions MMI old and MMI new along with its normalized version NMMI are compared with the existing external cluster validity indices, F-measure, purity, normalized mutual information (NMI), rand index (RI), adjusted rand index (ARI). Proposed MMI index works well for two class and multi class data sets.
منابع مشابه
A Facility Location Problem with Tchebychev Distance in the Presence of a Probabilistic Line Barrier
This paper considers the Tchebychev distance for a facility location problem with a probabilistic line barrier in the plane. In particular, we develop a mixed-integer nonlinear programming (MINLP) model for this problem that minimizes the total Tchebychev distance between a new facility and the existing facilities. A numerical example is solved to show the validity of the developed model. Becau...
متن کاملAnalysis of Tourist Cluster in Mazandaran Using SWOT Approach
Clusters are geographically close groups of related companies or institutions related to a certain area which are inherently more efficient than the other companies due to advantages such as being located in one place, networks, external knowledge, variability of human capital, etc. Today, development through clusters plays a pivotal role in the economic and industrial policies of developed...
متن کاملIdentification of Power Stripping Resources with Fuzzy Cluster Dynamic Approach (Case Study: West Azerbaijan Province)
Reducing electric power theft is a significant part of the potential benefits of implementing the concept of smart grid. This paper proposes a data-based approach to identify locations with unusual electricity consumption. The new distance-based method classifies the new data as violator costumers, if their distance is long to the primary consumption data. The proposed algorithm determines the ...
متن کاملPsychometric Analysis of Hypertension Self-Management Behaviors Questionnaire; an Application of Intervention Mapping Approach in Questionnaire Development
Aims: High blood pressure is one of the common main preventable risk factors for many diseases. This study aimed to psychometric properties of the cognitive determinants of hypertension self-management questionnaire among Iranian hypertensive patients based on the Intervention Mapping approach. Instrument & Methods: This psychometric study was conducted in Abadan in 2019. Content Validity Rati...
متن کاملThe Role of Knowledge Management Elements in the Improvement of the Faculty Members in Distance Education Universities) designing an appropriate model (
Background and Objective: Given the importance and status of faculty members in universities, the advancement of the duties and missions of the higher education system and rapid development of the technologies and challenges faced by educational institutions require proper measures for the continuous development and overall improvement of these systems, especially the improvement of the capacit...
متن کامل